Applying Perceptual Distance to the Discrimination of Sounds

نویسنده

  • Preeti Rao
چکیده

In sound compression and synthesis, it is valuable to have objective distance measures which can predict the perceived dissimilarity of two sounds. Furthermore it is desirable also to be able to estimate the extent of the dissimilarity or the quality difference between sounds. Since it is the perceived difference that needs to be quantified it can be expected that measures that are derived from models of hearing will provide the best predictions. An overview of the topic of auditory modelling is presented. An objective measure based on a distance metric applied to auditory excitation patterns is described. This perceptual measure is compared with a traditional spectral distortion measure in the prediction of the discrimination of spectral envelope distortions of vowel-like sounds. The results of a subjective experiment indicate that the perceptual measure is significantly superior.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of motor fan radiated sound and vibration waveform by automatic pattern recognition technique using “Mahalanobis distance”

In recent years, as the weight of IT equipment has been reduced, the demand for motor fans for cooling the interior of electronic equipment is on the rise. Sensory test technique by inspectors is the mainstream for quality inspection of motor fans in the field. This sensory test requires a lot of experience to accurately diagnose differences in subtle sounds (sound pressures) of the fans, and t...

متن کامل

Phonological Representations Are Unconsciously Used when Processing Complex, Non-Speech Signals

Neuroimaging studies of speech processing increasingly rely on artificial speech-like sounds whose perceptual status as speech or non-speech is assigned by simple subjective judgments; brain activation patterns are interpreted according to these status assignments. The naïve perceptual status of one such stimulus, spectrally-rotated speech (not consciously perceived as speech by naïve subjects)...

متن کامل

Estimation of Perceptual Spaces for Speaker Identities Based on the Cross-Lingual Discrimination Task

This paper reconfirms that talker identity can be transmitted across languages. Talker discrimination was examined in the ABX paradigm, where the stimuli A and B were utterances by different talkers in the same language and the stimulus X was an utterance by either of A or B in the different language. The average hit rate of this discrimination task was as high as 0.89. The mutual distance matr...

متن کامل

Pathologies cardiac discrimination using the Fast Fourir Transform (FFT) The short time Fourier transforms (STFT) and the Wigner distribution (WD)

This paper is concerned with a synthesis study of the fast Fourier transform (FFT), the short time Fourier transform (STFT and the Wigner distribution (WD) in analysing the phonocardiogram signal (PCG) or heart cardiac sounds.     The FFT (Fast Fourier Transform) can provide a basic understanding of the frequency contents of the heart sounds. The STFT is obtained by calculating the Fourier tran...

متن کامل

A measure for predicting audibility discrimination thresholds for spectral envelope distortions in vowel sounds.

Both in speech synthesis and in sound coding it is often beneficial to have a measure that predicts whether, and to what extent, two sounds are different. This paper addresses the problem of estimating the perceptual effects of small modifications to the spectral envelope of a harmonic sound. A recently proposed auditory model is investigated that transforms the physical spectrum into a pattern...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002